DRAFT VERSION JUNE 27, 2008, ACCEPTED FOR PUBLICATION IN THE ASTROPHYSICAL JOURNAL 
Preprint typeset using LM^X style cmulatcapj v. 04/21/07 



A MOLECULAR EINSTEIN RING AT Z=4.12: 
IMAGING THE DYNAMICS OF A QUASAR HOST GALAXY THROUGH A COSMIC LENS 

DOMINIK A. RlECHERS 1,2 ' , FABIAN WALTER 1 , BRENDON J. BREWER 3 , CHRISTOPHER L. CARILLI 4 , 

Geraint F. Lewis', Frank Bertoldi 5 , and Pierre Cox 6 

draft version June 27, 2008, Accepted for Publication in the Astrophysical Journal 



00 

o 

O 

(N 

a 
>—> 

(N 

6 



ABSTRACT 

We present high-resolution (0.3") Very Large Array (VLA) imaging of the molecular gas in the host 
galaxy of the high redshift quasar PSS J2322+1944 (z = 4.12). These observations confirm that the 
molecular gas (CO) in the host galaxy of this quasar is lensed into a full Einstein ring, and reveal the 
internal dynamics of the molecular gas in this system. The ring has a diameter of ~1.5", and thus is 
sampled over ~20 resolution elements by our observations. Through a model-based lens inversion, we 
recover the velocity gradient of the molecular reservoir in the quasar host galaxy of PSS J2322+1944. 
The Einstein ring lens configuration enables us to zoom in on the emission and to resolve scales down 
to <lkpc. From the model-reconstructed source, we find that the molecular gas is distributed on a 
scale of 5kpc, and has a total mass of M (H2) = 1.7xl0 10 M Q . A basic estimate of the dynamical mass 
gives Mdyn = 4.4 x 10 10 sin" 2 iMq, that is, only ~2.5 times the molecular gas mass, and ~30 times 
the black hole mass (assuming that the dynamical structure is highly inclined) . The lens configuration 
also allows us to tie the optical emission to the molecular gas emission, which suggests that the active 
galactic nucleus (AGN) does reside within, but not close to the center of the molecular reservoir. 
Together with the (at least partially) disturbed structure of the CO, this suggests that the system 
is interacting. Such an interaction, possibly caused by a major 'wet' merger, may be responsible for 
both feeding the quasar and fueling the massive starburst of 680 M yr _1 in this system, in agreement 
with recently suggested scenarios of quasar activity and galaxy assembly in the early universe. 
Subject headings: galaxies: active, starburst, formation, high redshift — cosmology: observations — 
radio lines: galaxies 
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1. INTRODUCTION 

A fundamental aspect in studies of galaxy formation 
and evolution is to understand the connection between 
AGN and starburst activity. The existence of a physical 
connection between both processes is suggested by the 
finding that present day galaxies show a strong relation- 
ship between the mass of their central supermassive black 
holes (SMBHs) and the mass an d conc entration of their 
stel lar sph eroids (Magorri an et a l.| l998t Ferrare se fc M er- 
ritt [20001 : Gebhardt et al. I200H Graham etal.HIU). If 
these relations were due to a coevolution of both compo- 
nents during the early assembly of a galaxy, high-redshift 
quasars and their associated host galaxies would be ideal 
objects to study the active formation of both SMBHs 
and bulge stars. 

Studies of molecular gas (most commonly rotational 
transitions of CO), the prerequisite material that fuels 
star formation, have become an important tool to probe 
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distant quasar host galaxies, and revealed large molec- 
ular gas reservoirs of >10 10 M Q in a numb er of these 
sources (see Solomon & Vanden Bout 120051 for a gen- 
eral review). These galaxies typically show huge far- 
infrared (FIR) luminosities in excess of 10 13 L Q , which 
are thought to be powered by s tarbur sts (and pos sibly a 
central AGN; e.g., Omont et al. 120011 : Wang et al. 120081) . 
Observations of molecular gas trace the regions that can 
host massive starbursts. In addition, the velocity struc- 
ture of molecular line emission has the potential to con- 
strain the dynamical state of galaxies out to the earliest 
epochs. 

Rotational molecular line emission typically emerges 
at FIR to radio wavelengths, i.e., in the limited wave- 
length regime where the AGN in distant quasars does 
not outshine all other emission. However, the cosmologi- 
cal distances of high redshift quasars make it difficult to 
resolve the faint emission from their host galaxies a such 
long wavelengths. The physical resolution of such obser- 
vations is in some cases boosted by gravitational lenses 
acting as natural telescopes. The gravitational lensing 
effect also magnifies the observed flux of the background 
galaxy, in particular for systems in Einstein ring config- 
urations. Due to the compactness of the AGN, optical 
quasars in Einstein ring lens configurations are rare. Due 
to their greater extent, the host galaxies of quasars are 
much more likely to cross the inner Einstein ring caustic 
of a gravitational lens. 

In this paper, we report on high (0.3") angular reso- 
lution Very Large Array (VLA) 8 observations of CO in 

8 The Very Large Array is a facility of the National Radio As- 
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the host galaxy of the z=4.12 quasar PSS J2322+1944, 
one of only two known z>4 galaxies that are both grav- 
itationally lensed and detected in molecular gas emis- 
sion (the o ther b eing BRI 0952-0115 at z=4.43; Guil- 
loteau et al. 1 19991 ) . This galaxy was identified in a spec- 
troscopic follow-up study o f the Palomar Sky Survey 
(DPOSS; Djorgovski et al. l2000fl . and found to be a 
strongly lensed optical quasar (S. G. Djorgovski, private 
communication). It was subsequently detected in hard 
X-ray (Vignaliet al. l2005f) . FIR dust (Omont et al. 120011 : 
Isaak et al. I2002D and radio continuum emission (Carilli 
et al. l2001h . as well as mole cular line emission (Cox et 
al. 120021 : Carilli et al. l2002fl . It follows the radi o-FIR 
correlation o f star- forming galaxies (Carilli et al. 2001; 
Beelen et al. [2006), indicating that its FIR continuum 
emission is dominated by intense star formation. In spite 
of the fact that this source shows only two unresolved 
quasar images in the optical, previous CO observations 
have shown that the molecular gas reservoir in it s host 
galaxy is lensed into an Einstein ring (Carilli et al. 120031 : 
hereafter: C03). These observations were also used to 
derive a first lensing model for this source. Based on the 
dynamical structure revealed by our new, higher resolu- 
tion observations of PSS J2322+1944, we have developed 
a new lensing model, which enables us to reconstruct the 
velocity gradient in the spatially resolved gas reservoir. 
We use a concordance, flat ACDM cosmology through- 
out, with H = 71 kms -1 M pc -1 , fi M =0.27, and fi A =0.73 
(Spergel et al. \2lM\2(M ). 

2. OBSERVATIONS 

We observed the CO(J=2^1) transition (^ rcs t = 
230.53799 GHz) towards PSS J2322+1944 using the VLA 
in B configuration between 2006 June 19 and July 
3, and in C configuration between 2002 October 21 
and November 15 (these short spacings data were pub- 
lished in the original study by C03). The total on- 
sky integration time in the 11 observing runs amounts 
to 70.5 hr. At z = 4.119, the line is redshifted to 
45.0351 GHz (6.66 mm). Observations were perfor med in 
fast-switching mode (see, e.g., Carilli & Holdawav ll999[ ) 
using the nearby source J23307+11003 for secondary am- 
plitude and phase calibration. Observations were carried 
out under very good weather conditions with 26 or 27 
antennas. The phase stability in all runs was excellent 
(typically <20° phase rms for the longest baselines). The 
phase coherence was checked by imaging the a calibra- 
tor (J23207+05138) with the same calibration cycle as 
that used for the target source. For primary flux cal- 
ibration, 3C48 was observed during each run. Due to 
the restrictions of the VLA correlator, one 50 MHz in- 
termediate frequency (IF) with seven 6.25 MHz channels 
was centered at the C0( J=2— »1) line frequency, leading 
to an effective bandwidth of 43.75 MHz (corresponding 
to 291 kms -1 at 45.0GHz). This encompasses almost 
the full CO line width as measured in the CO( J =5— >4 ) 
transition (Aufwhm = 273±50 kms -1 , Cox et al. l2002f) . 
Earlier observations set a 2 a limit of 150/xJy on t he con - 
tinuum emission at the line frequency (Carilli et al. 2002). 

For data reduction and analysis, the AIPS package was 
used. All data were mapped using the CLEAN algorithm 

tronomy Observatory, operated by Associated Universities, Inc., 
under a cooperative agreement with the National Science Founda- 
tion. 
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Fig. 1. — VLA map of the CO(J=2— >1) emission to- 
ward PSS J2322+1944 (integrated over the central 37.5 MHz, or 
250kms~ 1 ). Contours are shown at (-3, -2, 1, 2, 3, 4, 5)xcr 
(lcr = 48/iJy beam -1 ). The beam size (0.33" xO.30") is shown 
in the bottom left corner. The large crosses show the positions of 
the quasar images at A t, s = 1.6 /im (A r = 314 nm), and the small 
cross shows the position of the lensing galaxy at the same A. 

and natural weighting. The synthesized clean beam us- 
ing all data has a size of 0.33" x 0.30". 9 The final rms 
over a bandwidth of 37.5 MHz (250 km s -1 , excluding the 
noisy edge channel) is 48/iJy beam -1 . In addition, seven 
velocity channel maps (6.25 MHz, or 42 kms -1 each) of 
the CO(J=2^1) emission were created. The rms after 
Hanning smoothing is 77^Jybeam -1 . Convolving the 
data to a linear spatial resolution of 0.5" leads to slightly 
higher rms values of 49 and 80 /iJy beam -1 for maps at 
250 and 42 kms -1 velocity resolution. 

3. RESULTS 

In FigureHJ the velocity-integrated CO(J=2^1) emis- 
sion over the central 250 kms -1 is shown. The emission 
is clearly resolved over multiple beams, and extended on 
a scale of ~1.5". The distribution of the gas is remi- 
niscent of a full, almost circular molecular Einstein ring, 
consistent with previous indications of such a structure in 
lower resolution observations (C03). The emission varies 
in intensity along the ring, showing clear substructure. 

The apparent surface brightness variations along the 
ring set strong constraints on the geometry of the lens 
configuration. The brightest CO peak on the ring can 
be used to set a lower limit on the intrinsic brightness 
temperature of the molecular gas. The peak strength 
of 280±48//Jy beam -1 corresponds to a beam- aver aged, 
rest-frame brightness temperature of Tb=8.7±1.5K. This 
is by a factor of a few lower than the kinetic gas temper- 
ature Tkin as predicted from CO lin e excitation models 
of this source (Riechers et al. 2006; A. Weifi et al., in 
prep.), and indicates that the substructure of the molec- 
ular reservoir is not fully resolved by the observations. 

We derive a spatially integrated CO( J=2— >1) line peak 
flux density of 2.50 ± 0.32 mJy, which is consistent with 
that found by Carilli et al. |2002) . Assuming constant Tb 

9 Imaging B array data only gives a resolution of 0.17"x0.15" 
(natural weighting), or 0.12"x0.11" (uniform weighting); however, 
most of the emission is outresolved in such maps. 
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between C0(J=2->1) and CO(J=1->0), and a CO lu- 
minosity to H2 mass conversion factor for ultraluminous 
infrared ga laxies [a=0.8M Q (K kms^'pc 2 ) -1 ; Downcs 
& Solomon Il998j |. this corresponds to a total molecular 
gas mass of M(H 2 ) = 9.0 x 10 10 /17 1 M© (where fj, L is 
the lensing magnification factor, see below). Within the 
uncertainties, this is in agreement with the value found 
by Riechers etal. (|2006l) based on the CO(J=1^0) lu- 
minosity. 

To increase the peak signal-to-noise ratio, Figure [2] 
shows the CO(J=2^1) emission convolved to 0.5" lin- 
ear resolution. This contour map is shown overlaid on 
a greyscale image of the (rest-frame) 314 nm contin- 
uum emission of the source, as observed by the Hub- 
ble Space Telescope (HS T/NIC MOS-2 F160W; image 
adopted from Peng et al. l2006t C. Y. Peng 2008, pri- 
vate communication). The two brightest spots in the 
optical image are unresolved lensed images of the AGN 
(henceforth 'A' and 'B', 10 also indicated as large crosses 
in Figure [1]) . The optical emission is likely dominated 
by the broad-line region of the AGN, and thus expected 
to emerge from a compact circumnuclear pc-scale re- 
gion. The quasar images have a brightness ratio of 
6(B,A)=0.181. They coincide within 0.10" with the po- 
sitions measured by Keck at 430 nm (2.2 /im observed 
frame; C03), i.e., within the relative astrometric er- 
rors. These positions however are clearly offset from 
the brightest peaks of the molecular line emission. The 
third spot in the optical image is the lensing galaxy ('G', 
also indicated as a small cross in Figure Q}. The lensing 
galaxy lies on the axis connecting images A and B, but 
by more than a factor of 2 closer to image B. It however 
lies in the very center of the molecular Einstein ring, as 
expected. At the position of the lensing galaxy, we mea- 
sure a 45.0 GHz radio continuum flux of 79 ± 47/iJy. 
This corresponds to only 1.7er, and thus has to be con- 
sidered tentative at best. 

In the adopted cosmology, the optical brightness of 
PSS J2322+1944 corresponds to an apparent bolomet- 
ric luminosity of Lboi = 2.1 x 10 14 ^^ 1 opt L Q (Isaak 
et al. ,2002). Assuming Eddington accretion, this cor- 
responds to an apparent black hole mass of Mbh = 
7.0 x 10 9 opt M@. Note that this may be considered a 
lower limit, as black holes in z ~ 4 quasars are found to 
have typical accr etion rates of M = Lboi/^Edd = 0.3—0.4 
(e.g., Shen et al. l2007f ). 

In Figure [31 seven 42 kms -1 wide velocity channels of 
the CO( J=2— >1) emission line are shown (channel 7 was 
omitted from the data shown in Figures Q] and [2] due to 
higher noise). Emission along the Einstein ring is de- 
tected in all channels. Clearly, the emission is moving 
systematically along the ring from the red part of the 
CO( J=2— >1) line to the blue. Note that there are many 
peaks of similar surface brightness (peak fluxes of 450- 
550 ^Jy beam -1 , or Tb=6-7K) that are found at different 
positions at different velocities. This indicates, to first 
order, a dynamical structure of uniform surface bright- 
ness, where the components at different velocities get 
projected to different positions in the lens plane. This 
puts us in the unique situation to model the gravita- 
tional lens configuration in this system in detail, and to 

10 By convention, A is the brightest image. 
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Fig. 2. — Contours of the CO(J=2— >1) emission as shown in 
Figure[T] but convolved to a linear resolution of 0.5" and overlaid 
on a HST NICMOS-2 image at X ^ s = 1.6 fJ.m (position crosses 
in Figure[T]l. Contours are shown at (-3, -2, 2, 3, 4, 5, 6, 7, 8)xcr 
(la = 49 fj,]y beam -1 ). Note that the HST image was not cleaned, 
and thus sh ows A iry rings around the unresolved quasar images 
(Peng et al. 20(3; C. Y. Peng 2008, private communication). 

recover the intrinsic dynamical structure of the quasar 
host galaxy in the background. 

4. GRAVITATIONAL LENS INVERSION 

In a previous attempt to model the gravitational 
lensing effect toward the molecular gas reservoir of 
PSS J2322+1944, C03 adopted a generic model based on 
the strongly lensed z=0.84 radio galaxy MG 1131+0456. 
By comparing various source configurations in the as- 
sumed lens potential and based on plausibility argu- 
ments, a model was found that described the overall 
properties of the observed ring structure. To enable us 
to describe the intrinsic properties of PSS J2322+1944 in 
more detail, we here present a direct model reconstruc- 
tion and inversion of the lensing effect in this system. 
This modeling is based on the new CO observations pre- 
sented in the previous section. 

4.1. Method: Bayesian Inference 

Due to the remaining observational uncertainties, it 
is not possible to derive a unique solution for the in- 
version of the gravitational lens. We thus explored the 
parameter space permitted by the data (and our mod- 
eling assumptions) to find the best possible solution; if 
some property of the models is consistent throughout this 
permitted volume, it can be considered well-constrained 
at high confidence. This parameter study thus follows 
a Bayesian approach (|Gregory||2005[ ). u sing the Markov 
Chain Monte Carlo (MCMC) code by iBrewer fc Lewis! 
(|2006al) . This algorithm does not simply aim at mini- 
mizing x 2 to find the best model (which may "overfit" 
the data by fitting part of the noise), but explores the 
whole range of plausible fits. By doing this, we can use 
a large number of parameters (e.g. source pixel values) 
in our model without overfitting the noise (since a broad 
region in parameter space with higher % 2 can outweigh 
a particular solution that has very low x 2 if the volume 
of the parameter space near the very low \ 2 solution is 
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Fig. 3. — Channel maps of the CO( J=2— >1) emission at 0.5" resolution (beam size is shown in the bottom left corner of the first panel). 
The same region is shown as in Fig. [2] One channel width is 6.25 MHz, or 42 kms [frequencies increase with channel number and are 
shown at 45016.35, 45022.60, 45028.85, 45035.10, 45041.35, 45047.60, and 45053.85MHz (red to blue)]. Contours are shown at (-3, -2, 2, 3, 
4, 5, 6) Xct (lcr = 80 /ijy beam^ 1 ). Note that the noise in channel 7 is intrinsically higher by a factor of >/2 relative to the other channels. 



very small). 

The Bayesian analysis thus encodes the phase space of 
possible source distributions allowed by the data into a 
probability distribution (e.g., Gregorv f2005l ). The lensed 
image of the source was reconstructed based on the in- 
tegrated CO emission line map and the structure de- 
tected in the velocity channels, and then used to derive 
a common model of the lensing galaxy's projected den- 
sity profile that reproduces the emission in all channels 
simultaneously. Due to the differential structure among 
the velocity channels, this implies that the reconstructed 
source components after lens inversion will be different 
for each velocity channel, and will reproduce the velocity 
gradient across the source. 

In this model description, the unknowns to be inferred 
from the data are the seven source profiles {si}J =1 , one 
for each velocity channel, where Sj is shorthand for a 
large number of (unknown) pixel values. The unknown 
lens model parameters are denoted collectively by L. 
Given observed data D, the posterior distribution for 
the unknown parameters is proportional to the product 
of the prior distribution and the likelihood function: 

p({ Si }, L\D) cx p({ Si }, L)p(D\{ Sl } : L). (f ) 

Here, D consists of seven extended images (one for each 
velocity channel). We make the following standard as- 
sumption for p(D\{si}, L): Assuming that we know the 
source and lens properties, we would predict the observed 
image to be the lensed, blurred image of that source, plus 
additive Gaussian noise: 



p(D\{si}, L) cx exp 



9 E- 



where 



■j _ v^" ( A,j - Mj(si,L) x 2 



E 

3=1 



(2) 



(3) 



Here, Dij is the j'th pixel of the i'th image, and M(si, L) 
is the model image calculated by lensing and blurring the 
i'th source with the proposed lens model L, and <Ji is the 
noise standard deviation in the i'th image estimated from 
the outer 'blank sky' regions of each image, i.e., distant 
from any detected structure. 

In this study, the lens was parameterized as a singular 
isothermal elliptical potential with five free parameters, 
which are the strength b of the lens, the ellipticity q of 
the potential, the central position {x c ,y c ) of the lensing 
source, and the angle of orientation 9 of the projected 
density profile. For this model, the lensing potential is 

(4) 



<f>(x,y) = by/ qx' 2 



y' 2 /q, 



where 



y 



cos 9 sin 9 
— sin 9 cos 9 



y-y c 



(5) 



The source plane position (x s ,y s ) corresponding to any 
lens plane position (x, y) is then given by the lens equa- 
tion: 



^ S r\ ) 

OX 

d<f> 

y s = y - -5- 

dy 



(0) 



The optical position of the lensing galaxy was not used 
as an initial model constraint to allow for a conservative 
treatment of the errors. However, the optical data is used 
in a second step to better constrain some of the source's 
intrinsic properties, as described in more detail below. 

4.2. Application to Interferometric Data 

Interferometer maps are reconstructed from visibility 
data using a scale that samples one synthesized interfer- 
ometer beam (i.e., resolution element) with multiple pix- 
els. This means that the noise is not independent for all 
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pixels, as assumed by Equation [3] Correct modelling of 
correlated noise is computationally very expensive, and 
makes the evaluation of the likelihood (Equation \2§ very 
slow. In interferometric images, the scale of the syn- 
thesized beam, or point-spread function (PSF), usually 
is the same as the noise correlation scale (depending on 
the interferometer baseline weighting function used in 
the imaging Fourier transform). A good PSF model thus 
will allow for a proper, but faster treatment of the noise 
properties. 

The correlation length scale of the noise was measured 
at a clear distance from the detected molecular struc- 
ture and used as the length scale for constructing an 
optimized Gaussian PSF. This length scale predicts that 
1/62 of the pixels are effectively independent at the scale 
of the images (0.03" pixel -1 ). This corresponds to 106 
pixels on the scale of the Einstein ring. To not "overfit" 
the image due to noise, only this fraction of informa- 
tion can be used to computationally determine the lens 
properties. This means that MCMC calculations have 
to be run at an "annealing temperature" of 62, or alter- 
natively, the cr's can be artificially increased by a factor 
V62 (e.g., Gregory [HH). The validity of this short-cut 
procedure was confirmed by ensuring that the residuals 
of the model images have the same statistical properties 
as the background in the observed maps. 

4.3. Priors 

The prior distribution for the lens parameters is chosen 
to be diffuse, but the final result is independent from this 
selection. For the seven unknown sources representing 
the source distribution in the velocity channels, we use 
indepe ndent "massive inference" priors (|Brewer fc Lewis! 
2006b). To generate a random source, a moderate num- 
ber of "atoms" of a certain brightness are added to a 
blank source plane. These atoms are given a uniform 
probability distribution in position and an exponential 
probability distribution in flux. The width of each atom 
in pixels is chosen at random from one of three values 
to allow for the expectation that pixel values should be 
correlated with their neighbours. This procedure gener- 
ates a random source in which most pixels are dim, and 
is a more appropriate prior for a stronomical sources tha n 
most conventional regularizers ([Brewer fc Lewis! l2006al) . 

4.4. Modeling Results 
4.4.1. Lens Parameters 

From the MCMC run, we find that the lens has a 
strength b = 0.745" ± 0.014", an ellipticity q = 0.969 ± 
0.014, and a position (x c = 0.074" ± 0.024"; y c = -0.109" 
± 0.029") [coordinates are relative to the center of the 
model images at a=23 h 22 m 07M76, <5=+19°44'22".16]. 
Due to the fact that q ~ 1, the lens potential is close 
to circular, as is the projected lens density profile. Even 
though the observed position of the lens (x° hs — 0.105"; 
y° hs = -0.080") was not taken as an input parameter, 
the model naturally reproduces its position within the 
errors. The posterior probability distribution for 9 is bi- 
modal; with a probability of 62% (38%), the angle of 
orientation of the potential is 9 = 109.2° ± 7.1° (62.7° 
± 6.5°). The lens parameters are constrained well by 
the molecular data alone, so the marginal posterior dis- 
tributions are close to Gaussian (except for the bimodal 



9 distribution, which is well approximated by a mixture 
of two Gaussians). Thus, all of the estimates and uncer- 
tainties quoted above are of the form (mean ± standard 
deviation) . 

The lensing model described above was derived based 
on the distribution of the lensed CO emission only to 
allow for a conservative treatment of the errors, and to 
avoid a main systematic source of error: the remaining 
astrometric uncertainties between the optical and radio 
reference frames. The results of our study indicate that 
the optical position of the lens is reproduced well, and 
thus, that the astrometric offset appears to be small. Due 
to the fact that gravitational lensing is achromatic, we 
thus can use the model derived based on the distribu- 
tion of the lensed CO emission only to also constrain the 
intrinsic optical properties of the source. We also can 
use the positions of the lensed images of the quasar in 
the optical to further constrain the allowed parameter 
space, and to estimate the position of the AGN within 
the deprojected molecular gas reservoir. 

Based on the sample of lenses that fit the molecular line 
data, the optical positions of the quasar were ray-traced 
back into the source plane. As the optical emission of 
the source is compact, only those models that map both 
quasar images onto the same position in the source plane 
within the errors can be considered valid. We thus dis- 
carded all models that did not fulfill this extra criterion. 

We find that this extra constraint only marginally 
changes previous results for the strength, ellipticity, and 
position of the lens. However, it does impact the so- 
lution for the angle of orientation of the potential, and 
supports 6 ~ 109°. As the source reconstructions pre- 
sented in this section are not strongly sensitive to the 
extra constraint, only results (and the more conservative 
errors implied) from simulations produced without using 
the optical data are shown unless stated otherwise. 

4.4.2. Source Profiles 

For the seven source profiles in the different velocity 
channels, a "best" estimate was obtained by taking the 
average of all sources encountered by the MCMC run. 
This gives the posterior mean for each source, which is an 
optimal estimate, as it minimizes the expected squared 
error. As no unique solution exists for the lens inversion, 
the full source sample has a certain diversity. However, 
by taking the mean, only reproducible features are re- 
tained from the full sample. Due to the fact that both the 
resolution and sensitivity of the observations are finite, 
the resulting surface brightness profiles are expected to 
be smooth on a certain critical scale. Substructure may 
appear on smaller scales, but will be smoothed out in 
the final model due to the larger uncertainites involved 
in reproducing such small structures across the permit- 
ted volume in parameter space. 

In Figure [U the model-reconstructed (posterior mean) 
molecular gas distribution in the seven CO velocity chan- 
nels (columns) are shown in the lens plane (middle row), 
and after lens inversion (top row), together with the ob- 
servations (not convolved, i.e., at full spatial resolution; 
bottom row) . Since the lens model is common to all veloc- 
ity channels, the changing distribution of molecular gas 
between the velocity channels is due to intrinsic velocity 
structure in the background source (note the difference 
in scale between the lensed and the unlensed source). 
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Fig. 4. — Reconstructed model images (posterior mean) of the 42 kms" 1 CO(J=2— >1) velocity channels, and source morphologies after 
lens inversion. The bottom row shows the observed CO( J=2— >1) emission in the velocity channels as shown in Figure \3\ but without 
applying a beam convolution (i.e., same resolution as Figure[T] thus ltr=77 ^Jy beam - 1 ). The middle row shows the model-reconstructed 
images of the channel maps in the lens plane (same scale as bottom). The top row shows the reconstructed images in the source plane after 
lens inversion. To account for the magnification of physical scales (and thus in effective spatial resolution), the model reconstruction in the 
source plane (same region as shown in Figure [6] scale in arcsec given on top) is shown zoomed-in by a factor of 2 relative to the lens plane. 



In Figure [5j a color composite map of the modeled dis- 
tribution of the CO in the velocity channels is shown. 
Orange corresponds to the redshifted part of the emis- 
sion line, green corresponds to the central part, and 
blue corresponds to the blueshifted part. In the right 
panel, the image in the lens plane is displayed, and in 
the left panel, the image in the source plane is shown 
after lens inversion. A clear velocity gradient is seen in 
both images. The brightest part of the emission shows a 
systemic velocity structure that would be in agreement 
with a highly inclined, almost edge-on rotating disk of 
^2.5 kpc radius within the uncertainties of the data and 
modeling. This structure encloses a dynamical mass of 
M<iyn = 4.4 x 10 10 sin~ 2 iM & (assuming a linewidth 
of 273 kms -1 as given above), corresponding to only 
2.5 times the lensing-corrected molecular gas mass (for 
highly inclined dynamical structures). However, there 
is also indication for a second, possibly tidal component 
(blue/green). This structure is slightly bent, and also 
has an extent of almost 5kpc, similar to the brighter 
component. Each component extends over at least 4- 
5 resolution elements in the source plane, and thus is 
clearly detected and resolved. 

The remaining uncertainties in the source reconstruc- 
tion can be quantified by examining individual MCMC 
samples to calculate a probabilistic description of differ- 
ent properties of the source. A manual comparison of 
models considered to be "good" by our statistical analy- 
sis however shows that all share the same principal fea- 
tures, and only show significant differences in substruc- 
ture on scales that are close to the resolution and de- 
tection limit, as expected. It thus is plausible that the 
overall structure and velocity gradient along the recon- 
structed source are real. 

To quantify the uncertainties in the structure of the 



molecular gas reservoir of PSS J2322+1944 in the source 
plane, the posterior probability P that the flux in each 
pixel is nonzero is shown in Figure [Sj The detection 
probability is shown over the full velocity range (i.e., 
summed up over all velocity channels). Due to the fact 
that observational data contains noise (i.e., finite positive 
or negative flux in every pixel of any velocity channel), 
the detection probability is nonzero in every pixel. Thus, 
a detection probability below a certain threshold has to 
be considered "background". The detection probability 
is consistently significantly higher than the background 
along the structures predicted by the "best model" (see 
Figures 0] and [5]) . 

The overall lensing magnification of the CO emission 
as predicted by the posterior distribution for the lens 
model is \il = 5.34 ± 0.34. Figure [7] shows the differ- 
ential flux magnification between the velocity channels, 
estimated by calculating flux(lcnscd image)/flux(source) 
for the MCMC samples of each velocity channel. Al- 
though consistent with unity within the modeling uncer- 
tainties, there is tentative evidence that the flux is more 
gravitationally enhanced in the line center than in the 
outer channels. If real, this would suggest that the pro- 
file of the CO(J=2^1) emission line is distorted by the 
lensing effect. 

The model-predicted lensing magnification of the 
314 nm continuum emission from the AGN as derived 
from models based on both optical and CO data is 
Mi Pt = 4.7±0.4. Excluding the optical constraints would 
result in slightly larger errors: /^ pt = 4.8±0.6. The over- 
lay in Figure [2] suggests that the optical AGN is offset 
from the peak position of the molecular line emission. 
In the red CO line wing, the northern peak of the Ein- 
stein ring is brighter (as is the case for the two optical 
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Fig. 5. — RGB composite color map of the model-reconstructed CO(J=2^1) velocity channels shown in Figure l4l with three colors 
encoding the velocity range of the emission [orange: redshifted (channels 1-2), green: central (channels 3-5), blue: blueshifted (channels 
6-7)] . The grey lines indicate the caustics. The bar at the bottom indicates the scale in the source plane. Right: Lensed, blurred image of 
the Einstein ring in the lens plane. Left: Image of the model-reconstructed quasar host galaxy in the source plane. 



images) , while in the blue linewing, the southern peak is 
brighter. This, together with the optical quasar image 
positions relative to the lens position, suggests that the 
AGN is located close to the orange peak of the recon- 
structed molecular gas distribution shown in Figure 
This finding is supported by the model-based inversion 
of the optical and CO data, although the determination 
of the exact location within that part of the molecular 
structure is limited by the remaining model uncertainties 
and the limited accuracy of the relative astrometry of the 
optical and radio data. If the AGN is indeed located in 
the upper part of the disklike structure rather than in the 
center, one may speculate that the spatial and velocity 
structure of the reconstructed source is more likely to be 
due to interaction than due to a rotating disk. 

5. DISCUSSION 

We have imaged and modeled a molecular Einstein ring 
of a galaxy at 2=4.12. Our high resolution CO(J=2^1) 
maps of the lensed quasar host galaxy of PSS J2322+1944 
(a double image optical quasar) reveal spatially resolved 
structure that shows a clear velocity gradient in the CO 
emission line. By performing a model-based lens inver- 
sion of the Einstein ring that is consistent with the data, 
we are able to reconstruct the velocity structure of this 
distant quasar host galaxy. The gravitational lensing ef- 
fect acts as a natural telescope, and allows us to zoom 
in on the molecular gas reservoir down to linear scales of 
only <lkpc, sufficient to reveal velocity structure over 
almost 10 resolution elements in the source plane. Our 
novel modeling of this system reveals how the molecular 
gas crosses the central caustic (causing the appearance 



of the Einstein ring) moving from the redshifted to the 
blueshifted molecular emission. We also find evidence 
that the optical quasar may be associated with the red- 
shifted part of the molecular reservoir. The full reservoir 
has a mass of M{H>}) = 1.7 x 10 10 Mq (corrected for 
lensing magnification). The molecular gas mass alone 
could account for almost half of the dynamical mass in 
this system if the galaxy were to be seen close to edge- 
on (Mdyn sin 2 i/M(H.2) — 2.5). Due to the large spatial 
extent of the CO emission, and due to the fact that the 
AGN is probably largely offset from the center of the 
reservoir, we conclude that the molecular gas and dust 
are likely dominantly heated by star formation. 

From the FIR luminosity of the source in the a dopte d 
cosmology (L Fm = 2.4 x 10 13 fx^ 1 L Q ; Cox etal.[2003), 
we derive a star formation rate 11 (SFR) of 680 Mq yr" 1 . 
At least part of the CO emission of the reconstructed 
source does not appear to follow a systemic trend in ve- 
locity. In this picture, this structure may be due to inter- 
action, possibly caused by a major merger. Such an event 
could both feed the AGN and fuel the starburst, and thus 
be responsible for the coeval assembly of a supermassive 
black hole and the stellar bulge in this system. Future 
observations of the FIR continuum at comparable spatial 
resolution may shed more light on this situation. 

Motivated by these results, the dynamical mass de- 

11 Assu ming SFR=1.5xlO- 10 L F ir(M q yr -1 / L ©) (Kenni- 
cutt 119981) . The dust temperature and spectral index of 
PSS J2322+1944 indicate that the FIR continuum emission is dom- 
inated by star formation, in agreement with the finding that the 
source follow s the radio-FIR correlation for star-forming galaxies 
(Beelen et al. 120061) . 
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Fig. 6. — Map of the detection probability P(pixel > 0) in the 
source plane after lens inversion, as derived from the full MCMC 
parameter study. A region of 1.21" X 1.21" size is shown. 

rived from the molecular line observations in the host 
galaxy of the z=4.12 quasar PSS J2322+1944 can be used 
in an attempt to constrain the relationship between the 
central SMBH mass and the stellar bulge mass (A/bh~ 
Ajfbuigc) in high-z AGN galaxies. Such a relation has 
been proposed for different types of galaxies in the lo- 
cal universe, and appears to hold over more than three 
orders of magnitude in SMBH mass, essentially inde- 
pendent of galaxy type (pre dictin g Mbuigo — 700 Mbh, 
e.g., Kormendy & Gebhardt |2001[ ). Assuming the opti- 
cal lensing factor of /i^ pt =4.7 derived from our model 
of PSS J2322+1944 and the Eddington limit derived in 
Section 3 gives M BH = 1-5 x 10 9 M Q . With the fur- 
ther assumption that the dynamical molecular structure 
is seen close to edge-on, and that it traces a major frac- 
tion of the gravitational potential that hosts the stellar 
bulge, we thus find that Mb u i g o — 30 Mbh (subtract- 
ing out the black hole and gas masses would even give 
Mbuigc < 20 Mbh)- This value is by more than an order 
of magnitude offset from the local MBH _ Mb u ig e relation, 
but in good agreement with results obtained for other 
high- z quasa rs, wh ich show si milar or larger offsets (e.g. , 
Walter et al. 12004 Weifiet al.[H3; Riechers et al. 120071 ; 
see also Shields et al. 12006( 1. 

In the case of PSS J2322+1944, one may attempt to 
account for this offset by assuming that the dynamical 
structure is seen close to face-on (i.e., i < 12°). This 
would however predict a large intrinsic CO linewidth 
(Awfwhm > 1300 kms -1 ). Such a linewidth would 
significantly exceed the velocity dispersions observed in 
the spheroids of massive present day elliptical galaxies 
(which PSS J2322+1944 will likely eveolve into), but is 
of the same order of magnitude as those observed toward 
some high-z submillimcter galaxies (e.g., Carilli & Wang 
2006). Such large molecular linewidths are consistent 
with those predicted by simulations of the hierarchical 
buildup of massive qu asar h ost galaxies at high redshift 
(e.g., Narayanan et al. 2008), and thus compatible with 
the possible merger nature of PSS J2322+1944. How- 
ever, in interacting or merging systems, molecular lines 
are likely more broad due to the fact that the dynam- 
ical molecular structure is not fully virialized yet. The 
molecular line widths in these galaxies thus may be by a 
factor of a few higher than the actual virial velocity of the 
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CO(2-1) velocity channel (Av = 42 km s" 1 per ch.) 

Fig. 7. — Model-predicted differential gravitational magnifica- 
tion between the CO velocity channels shown in Figures [51 and 131 
The magnification is shown in magnitudes (i.e., no magnification 
corresponds to = 0). The error bars include the modeling un- 
certainties. The numbers on top indicate the magnification in each 
velocity channel. The dashed line indicates the total magnification 
of /i£=5.34 in the integrated emission line map. 

host halo, which would lead to an overprediction of the 
bulge mass. It thus appears difficult to explain the offset 
from the local MBH-Mb u i gc relation by simply assuming 
a small inclination angle toward the line of sight. Also, 
assuming that the AGN accretes at sub-Eddington rates, 
and/or taking into account that more than a third of the 
dynamical mass derived above is likely not stellar, but 
accounted for by gas, dust (<1% of the gas mass), and 
the black hole further increases this offset. Together with 
previous such examples, our results for PSS J2322+ 1944 
thus appear to indicate that the black holes in massive 
galaxies at high redshift assemble earlier than a large 
fraction of their stellar bulges. 

The observations and modeling presented herein 
demonstrate the power of spatially and dynamically re- 
solved molecular gas studies in strongly lensed, distant 
AGN-starburst systems to provide direct evidence for 
the scenarios of quasar activity and galaxy assembly in 
the early universe as suggested by rec ent cosmological 
simulations (e.g., Springel etal. l2005f ). The boost in 
line intensity and spatial resolution provided by Einstein 
ring lens configurations are currently the only means 
by which to probe the dynamical structure of the most 
distant star-forming galaxies at (sub-)kiloparsec resolu- 
tion. Such observations provide an important founda- 
tion for future observations of molecular gas and dust 
in the early universe with the Atacama Large Millimc- 
ter/submillimeter Array (ALMA), which will be able to 
probe more typical galaxy populations at high redshift to 
comparable and higher physical resolution, even without 
the aid of gravitational lensing. 
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